Forward-Decoding Kernel-Based Phone Sequence Recognition
نویسندگان
چکیده
Forward decoding kernel machines (FDKM) combine large-margin classifiers with hidden Markov models (HMM) for maximum a posteriori (MAP) adaptive sequence estimation. State transitions in the sequence are conditioned on observed data using a kernel-based probability model trained with a recursive scheme that deals effectively with noisy and partially labeled data. Training over very large datasets is accomplished using a sparse probabilistic support vector machine (SVM) model based on quadratic entropy, and an on-line stochastic steepest descent algorithm. For speaker-independent continuous phone recognition, FDKM trained over 177,080 samples of the TIMIT database achieves 80.6% recognition accuracy over the full test set, without use of a prior phonetic language model.
منابع مشابه
Forward-Decoding Kernel-Based Phone Recognition
Forward decoding kernel machines (FDKM) combine large-margin classifiers with hidden Markov models (HMM) for maximum a posteriori (MAP) adaptive sequence estimation. State transitions in the sequence are conditioned on observed data using a kernel-based probability model trained with a recursive scheme that deals effectively with noisy and partially labeled data. Training over very large data s...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملSub-Microwatt Analog VLSI Support Vector Machine for Pattern Classification and Sequence Estimation
An analog system-on-chip for kernel-based pattern classification and sequence estimation is presented. State transition probabilities conditioned on input data are generated by an integrated support vector machine. Dot product based kernels and support vector coefficients are implemented in analog programmable floating gate translinear circuits, and probabilities are propagated and normalized u...
متن کاملProject Summary Report the Catalyst Foundation -svm and Fdkm Have Demonstrated State-of-art Performance on Various Signal Processing Tasks in Speech and Image Recognition
Driven by the proliferation of portable devices like cellular phones, personal digital assistants (PDAs) and smart wrist watches there has been an ever increasing demand for efficient and robust user interfaces. An intelligent speech interface offers an attractive alternative to other means of communication and provides hands free communication with these portable devices. Miniature handheld an...
متن کاملDesign and Implementation of Ultra-Low Power Pattern and Sequence Decoders
A key challenge in embedding pattern recognition intelligence onto ubiquitous sensing and communication interfaces in wireless integrated systems is to balance requirements on precision, complexity and power consumption in VLSI implementation. This dissertation investigates architectures for adaptive pattern recognition and sequence decoding, derived from statistical learning theory and Bayesia...
متن کامل